Markdown Monster icon

About

I am Jinpu Cao, a graduate student of the Sustainable Design and Construction Program in the Civil and Environmental Engineering Department at Stanford University (2020 - 2022). My research interests lie in applying data-driven methods and Artificial Intelligence (AI) to Architecture Engineering and Construction (AEC) to help us create smart and sustainable infrastructures and living environments.

Education Experience

Conferences

  1. Speaker in the 4th International Conference on Information Technology in Geo-Engineering (4ICITG)

    The conference was organized by Geotechnical Society of Singapore in August 2022.

    Presentation: “A Long-term Probabilistic Forecasting Approach of TBM Operating Parameters based on Deep Learning.

  2. Speaker in the II International Geo-science Machine Learning Big Data Seminar.

    Hosted by ISSMGE Technical Committee of Machine Learning and Big Data (TC309) Risk and Insurance Research Branch of China Civil Engineering Society in July 2019, Shanghai.

    Presentation: “Predicting TBM Performance using Machine Learning: is Surrounding Rock Information Important.

Publications

  1. ” An LSTM-based model for TBM performance prediction and the effect of rock mass grade on prediction accuracy ” (China Civil Engineering Journal, first author, accepted)

  2. “PigSense: Vibration-based Activity and Health Monitoring System for Pigs” (ACM Journals, co-author, in review)

  3. “Improving adaptation to wildfire smoke and extreme heat in vulnerable communities: Evidence from a pilot study in the San Francisco Bay Area” (Environmental Research Letters, contributor, ready to submit)

Research Projects

  1. Computer Vision-based Pavement Distress Detection System (March 2022-Now)

    Key Words: Computer Vision, Crack Detection, Web Development, Business Value, Deep Learning     (Python, TensorFlow, Dash)

    Worked as a research assistant advised by Pooja Jain (V.P. Strategic Innovation in WSP) and Dr. Martin Fischer (Professor at Stanford).

    This is a significant exploration of developing the practical workflow of a vision-based automatic pavement distress detection system and software kit, and I am also responsible for discovering the potential business value of the process.

  2. Pilot Air Quality Analysis in the Bay Area (Mar 2022 - Now)

    Key Words: Data Manipulation, Time Series Decomposition, Clustering, Regressive Analysis, Community-engaged     (R, MATLAB)

    Worked as a research assistant advised by Derek Ouyang (Research Manager at the RegLab, Stanford) and Dr. Gabrielle Wong-Parodi (Professor at Stanford).

    This community-engaged pilot study aims to improve adaptation to wildfire smoke and extreme heat in vulnerable communities. I am responsible for characterizing these communities’ air quality and exploring their human-induced and environment-induced influence on it. The team I led characterized the relationship between indoor and outdoor air quality with a spike lag model and verified it by actual monitoring data, which will be a vital contribution to our new journal paper.

  3. PigSense: Structural Vibration-based Activity and Health Monitoring System for Pigs (Jan 2022 - Sept 2022)

    Key Words: Signal Processing, Classification, Machine Learning     (MATLAB, Python, scikit-learn)

    Worked as a researcher advised by Dr. Hae Young Noh (Professor at Stanford) in Stanford Structures as Sensors Lab

    This project introduces the first system to track animals using structural vibrations, and the first system for automated characterization of piglets’ group activities, including nursing, sleeping and active times. I have realized the classification of piglet group activities using structural vibration data on machine learning methods.

  4. Long term probability prediction platform of urban water consumption (Oct 2020 - Sept 2021)

    Key Words: Time Series, Probabilistic Prediction, DeepAR     (Python, GluonTS)

    Worked as a research assistant advised by Dr. Fang Liu (Professor at Tongji University)

    This was a practical and valuable project to build an intelligent diagnosis, risk reasoning and decision support system for municipal facilities. I developed a water consumption probability prediction model based on a deep autoregressive model (DeepAR) independently, and the model has been partially integrated into the Shanghai Smart Intelligent Platform to predict daily water consumption and provide pipe burst early warning services.

  5. Tunnel Boring Machine (TBM) Operation Parameters Prediction (Jan 2019 - Sept 2019)

    Key Words: TBM, Parameter Prediction, LSTM, Data Augmentation     (Python, Keras)

    Worked as a research assistant advised by Dr. Fang Liu (Professor at Tongji University)

    The project aims to predict TBM operation parameters based on its historical data, promoting safe and efficient tunneling construction of TBM. I independently developed TBM parameters and geological prediction model based on Long Short-Term Memory (LSTM) network , and won the National Third Place in the data mining competition organized by the Chinese Society of Rock Mechanics and Engineering.

  6. Battery Lifetime Prediction with Limited Cycle Data (Sept 2020 - Dec 2020)

    Key Words: Battery Lifetime Prediction, CNN, Bi-LSTM, Confidence Interval     (Python, TensorFlow, PyTorch).

    Accurately predicting the remaining useful lifetime of batteries is critical for accelerating technological development and creating a paradigm shift in battery usage. Data-driven approaches,based on large datasets, provide a physical-model agnostic way to predict the health status of batteries with high accuracy. However, most datadriven methods on battery life prediction often rely on features extracted from a hundred cycles worth of data for a given cell, making it computationally inefficient and incompatible with on-board application.

    The course project (CS 329P Practical Machine Learning) applied machine-learning models, including linear regression, random forest regression, convolutional neural networks, and recurrent neural networks to make predictions on cell life. Our best model achieve a 7.5% prediction error given the data of only 5 cycles.

  7. Sustainable Urban System Projects (Oct 2021 - June 2022)

    Key Words: Data Manipulation and Visualization, Geospatial Data, Census Data, Equity analysis, Monte Carlo simulations, Regression, Causality Analysis, Web Application     (R, Shiny).

    The Sustainable Urban Systems (Stanford CEE 218, Shaping the Future of the Bay Area) emphasis merges traditional data analytics with complex systems analysis to better inform decisions around the wicked problems of urban development like urban land use, mobility, sustainability and hazard analysis.

  8. Machine Learning Contest: Infrared Spectrum Classification (July 2022)

    Key Words: Infrared Spectrum Classification, Machine Learning     (Python, AutoGluon).

    The Near Infrared Spectroscopy Branch of the China Instrument Society holds the data modeling contest. The organizer provides a set of near-infrared spectral data from the actual application scenario. Different data preprocessing technologies (e.g. PCA, normalization) and machine learning models (AutoGluon) were tried for the unfrared spectrum classification problem.

Honors

Stanford University

    2020-2022 Outstanding Project of SFBI

Tongji University

    2017-2020 Outstanding Graduates in Shanghai (Top 5% of 220,000 students)

    The First Prize Merit-Scholarship of Tongji University (Top 5% of 400 Students)

    Shanghai Scholarship (Top 5% of 220,000 students)

    Excellent Students of Tongji University (Top 3% of 4000 students)

    The Second Prize of Structure Design and Model Competition

    National College Students’ Mathematics Competition 2016-2017 The Second Prize of Shanghai Division

    National College Students’ Mathematical Modeling Competition 2016-2017 The Third Prize in the Shanghai Division

Internship

Shanghai Shentong Metro Group (Jun 2019-Aug 2019)

Subway Monitoring Intern

This was an interesting practical expedition to assist in monitoring and analyzing the subway deformation and settlement near an ultra-deep foundation pit (Xuhui Center, Shanghai). I processed and visualized the subway deformation monitoring data based on python and proposed corresponding corrective measures to the construction unit.

CV & Bio

Download my CV here. Last updated Oct 2022.

Email:

GitHub: https://github.com/J-i-n-p-u

Twitter: @Jinpu_C

Linkedin: https://www.linkedin.com/in/jinpu-cao-a003bb1b6